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ABSTRACT 

Dark matter haloes are biased tracers of the underlying dark matter distribution. 
We use a simple model to provide a relation between the abundance of dark matter 
haloes and their spatial distribution on large scales. Our model shows that knowledge 
of the unconditional mass function alone is sufficient to provide an accurate estimate 
of the large scale bias factor. Then we use the mass function measured in numerical 
simulations of SCDM, OCDM and ACDM to compute this bias. Comparison with 
these simulations shows that this simple way of estimating the bias relation and its 
evolution is accurate for less massive haloes as well as massive ones. In particular, we 
show that haloes which are less/more massive than typical A/* haloes at the time they 
form are more/less strongly clustered than formulae based on the standard Press- 
Schechter mass function predict. 
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1 INTRODUCTION 

There has been considerable interest recently in developing 
models for the shape and evolution of the mass function of 
collapsed dark matter haloes (Press & Schechter 1974; Bond 
et al. 1991; Lacey & Cole 1993) as well as for the evolution of 
the spatial distribution of these haloes (Mo & White 1996; 
Catelan et al. 1997; Sheth & Lemson 1999). In these models, 
the haloes are biased tracers of the underlying dark matter 
distribution. In general, this bias depends on the halo mass, 
and, for a given mass range, it is a nonlinear and stochastic 
function of the underlying dark matter density field. 

The shape of the mass function (and its evolution) pre- 
dicted by these models is in reasonable agreement with what 
is measured in numerical simulations of hierarchical clus- 
tering from Gaussian initial conditions (e.g. Lacey & Cole 
1994). Although this agreement is by no means perfect (see 
Fig. H below), to date, the emphasis has been on how well 
the models fit the simulations (but see Tormen 1998). On 
the other hand, recent work has shown that, while the model 
predictions for the bias relation are in reasonable agreement 
with what is measured in numerical simulations for massive 
haloes, less massive haloes are more strongly clustered, or 
less anti-biased, than the models predict (Jing 1998; Sheth 
& Lemson 1999; Porciani, Catelan & Lacey 1998). 

In this paper, following a suggestion by Sheth & Lem- 
son (1999), we argue that this discrepancy between the bias 
model and simulation results arises primarily because the 
model mass functions are different from those in the simula- 
tions. In Section |^ we provide a simple relation between the 



mass function and the large scale bias factor. We then use 
the mass function measured in the simulations to show that 
this relation is accurate. Section ^ shows that our model pro- 
vides a reaonably good fit to the bias relation of less massive 
haloes as well as massive ones. 



2 MASS FUNCTIONS AND LARGE SCALE 
BIASING 

Let f(m, S) dm, where 8 = S(z) is a function of redshift z, 
denote the fraction of mass that is contained in collapsed 
haloes that at z have mass in the range dm about m. The 
associated unconditional mass function is 



n(m, S) dm = — }{m, 5) dm 



(1) 



where p is the background density. Let /(mi, <5i|Mo, So) de- 
note the fraction of the mass of a halo Mo at zo that was in 
subhaloes of mass mi at zi, where zi > zo- The associated 
conditional mass function is 



AT(1|0) = ^f(m 1 ,8 1 \Mo,S ). 
mi 



(2) 



Finally, let N(mi, 5i\M, V, zo) denote the average number of 
(mi,5i) haloes that are within cells of size V that contain 
mass M at zq. The halo bias relation is defined by 



<5h(l|0) 



Njm^S^M, V,zq) 
n(mi, Si) V 



(3) 
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Since M/V = p(l + S), this expression provides a relation 
between the overdensity of the halo distribution and that of 
the matter distribution. To compute this quantity, we need 
models for the numerator and the denominator. This can be 
done as follows. 

An Mq halo which collapsed at zo initially occupied a 
certain volume Vo in the initial Lagrangian space. Let So 
denote the initial overdensity within this Vo- If \8o\ <C 1, 
then Mo = p(l + So)Vo ~ p Vo- So the mean Lagrangian 
space bias between haloes and mass is given by setting 
N(mi,8i\M ,V,z ) = A/"(1|0) and V = V in equation 
above. Thus, 



rlvilnN _ N(mi,5i\Mo,Zo) 



#(1|0) = 



n(mi,6i) Vo 



(4) 



This expression was first derived by Mo & White (1996). It 
can be expanded formally as a series in So: 

5i(l\0)=J2 b k(mi,5i)5l (5) 

k 

in which form it will appear later. 

To compute the bias relation in the evolved Eulerian 
space they suggested the following procedure. Continue to 
assume that N(mi, <5i[Mo, V, Zo) can be written as some 
A/"(1|0), but now provide a new relation between the Eule- 
rian variables Mo, V, and zo, and the Lagrangian variables 
Mo, Vo and So. For any given zo, Mo & White used the 
spherical collapse model to write 8o(Mo/pV). Recall that 
Mo I pV = (1 + 5). Thus, So is a function of S, and, in general 
8o(S) is nonlinear. However, when \S\ <C 1, then So ~ S. On 
large scales in Eulerian space, the rms fluctuation about the 
mean density is small. Therefore, on sufficiently large scales, 
\S\ <g 1 almost surely. So, on large scales in Eulerian space, 



*h(l|0) 



N(mi,8i\M,V,z Q ) 1 
n(mi, Si) V 

(1+8) [l + d£(l|0)] -i 

[l + 6 La g(mi,5i)] 8, 



(6) 



where the final line follows from using equation (|5|), setting 
So ~ 8, and so expanding to lowest order in 5 (since \8\ <C 1), 
and writing &L ag instead of b\ . This provides a simple linear 
relation between 8^ and 8, where the constant of propor- 
tionality is 



&Eui(mi,5i) = 1 + &Lag(mi,<Si). 



(7) 



Furthermore, Mo & White (1996) argued that one can 



write 



(<5h) 2 ^) v ~ &!ui(mi,(5i) (s 2 ^j , 



(8) 



where the average is over cells of size V placed randomly 
in Eulerian space. This expression neglects the effects of 
stochasticity on the bias relation (see Sheth & Lemson 1999 
for details). The left hand side is the volume average (over 
cells of size V) of the halo-halo correlation function, whereas 
(8 2 ) is the volume averaged correlation function of the dark 
matter. Thus, in this limit, the ratio of the volume averaged 
correlation functions gives the square of the Eulerian bias 
factor. 

Now, &Eui( m lj^i) is a function of halo mass only; it 
does not depend on scale. So, in this large scale limit, the 
ratio of the volume averaged correlation functions is scale 



independent. Therefore, the ratios of the correlation func- 
tions themselves should also be b| ul , independent of scale. 
However, correlation functions and their volume integrals 
are just differently weighted integrals over the associated 
power spectrum. If &e u i is independent of scale on large 
scales, then the ratio of the halo and dark matter power 
spectra should also be b% ui , independent of wavenumber k 
for small wavenumbers. Thus, in the large scale limit, we 
expect the bias factor to be the same, regardless of whether 
it was computed using the counts-in-cells method, the cor- 
relation functions themselves, or power spectra. Moreover, 
in this limit, the Eulerian bias is trivial to compute if the 
Lagrangian space bias is known. 

The Lagrangian bias factor is different for haloes of dif- 
ferent masses. The work of Mo & White (1996) shows that 
the exact form of this dependence can be computed provided 
that both the conditional and the unconditional mass func- 
tions are known (equation ^|). Our contribution is to add 
one more simple step to the argument. Namely, on the large 
scales discussed above, the peak background split should 
be a good approximation (e.g. Bardeen et al. 1986; Cole 
& Kaiser 1989). In this limit, when expressed in appropri- 
ate variables, the conditional mass function is easily related 
to the unconditional mass function. Therefore, on the large 
scales where the peak background split should apply, the 
bias relation can be computed even if only the unconditional 
mass function is known. 

To summarize: we have used the peak background split 
to argue that the dependence of &Lag (and so &e u i also) on 
halo mass should be sensitive to the shape of the uncondi- 
tional mass function; different models for the unconditional 
mass function will produce different bias relations. In par- 
ticular, if the unconditional mass function is different from 
the one in simulations, the predicted bias will also be differ- 
ent. In the next section we will use the unconditional mass 
function measured in simulations as input to the formulae 
above to compute the associated halo to mass bias relation. 



3 THE SIMULATIONS 

In what follows, we will study the halo distributions which 
formed in what are known as the GIF simulations, which 
have been kindly made available by the GIF/ Virgo collab- 
oration (e.g. Kauffmann et al. 1998a). We will show results 
for three choices of the initial fluctuation distribution be- 
longing to the cold dark matter family: a standard model 
(SCDM: Q = 1, O a = 0, h = 0.5Q), an open model (OCDM: 
Slo = 0.3, £Ia = 0, h = 0.7) and a flat model with non- 
zero cosmological constant (ACDM: flo = 0.3, S1a = 0.7, 
h — 0.7). These simulations were performed with 256 3 par- 
ticles each, in a box of size L = 85 Mpc/ft for the SCDM 
run, and L = 141 Mpc//i for the OCDM and ACDM runs. 
The models and resolution are similar to those presented by 
Jing (1998). 

We measure mass functions in the simulations in the 
usual way, using a spherical overdensity group finder (see 
Tormen 1998 for details). Previous authors have computed 
the large scale bias factor by using the ratio of the volume 



The Hubble constant is H = WOh km s _1 Mpc _1 
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Figure 1. The square root b(k) of the ratio of the power spectra 
of dark matter haloes to that of the dark matter, computed at 
the time the haloes were identified: z = 0, z = 2 and z = 4 for 
the top, middle and bottom rows, respectively. The different line 
types in each panel correspond to haloes of different mass ranges 
(mass bins increase in size by factors of two). The more massive 
haloes at a given redshift typically have larger values of b(k). 



averaged halo and mass correlation functions, obtained us- 
ing a counts-in-cells procedure (Mo & White 1996; Sheth 
& Lemson 1999). Others have used the ratio of the correla- 
tion functions directly (Jing 1998; Porciani, Catelan & Lacey 
1998). Here, we will compute the bias factor by measuring 
the ratio of the power spectra of the haloes and the dark 
matter. On the large scales on which the peak background 
split should apply, all these measures should be equivalent. 
Indeed, the extent to which these procedures all agree is a 
measure of the linearity of the large scale bias relation (equa- 
tion ^|), and the accuracy of the assumption that the effects 
of stochasticity on the bias relation are trivial to estimate. 

We will begin our comparison with simulations by show- 
ing the ratios of the halo and dark matter power spectra. 
The various panels in Fig. [jjshow the square root of b 2 (k) = 
fhaio(fc)/fmatter(fc) for haloes identified at Zf orm = 0, 2, and 
4, at which time both Phaio(fc) and Pmatter(fc) were com- 
puted. The various line types in each panel correspond to 
haloes with mass in different mass ranges. The lowest mass 
halo we consider has twenty particles, and the mass bins 
increase in size by factors of two. Thus, the lower limit of 
the mth bin is at 10 x 2 m particles per halo (m > 0). One 
consequence of this is that the highest redshift outputs of 
the GIF simulations are only able to probe high values of 
b(k), whereas lower redshift outputs primarily probe lower 
values of b(k). 

An optimist would argue that the figure shows that b(k) 
is approximately independent of k for small k, though this 
approximation is more accurate when b(k) ~ 1 than other- 
wise. At a given redshift, the small k value of b(k) increases 
with halo mass. If the model presented in the previous sec- 



tion is correct, then we should be able to use the shape of 
the halo mass function to model this dependence accurately. 
We turn, therefore, to the shape of the halo mass function 
in the GIF simulations. 

The shape of the unconditional mass function is ex- 
pected to depend on the initial fluctuation distribution (e.g. 
Press & Schechter 1974). If the initial distribution is Gaus- 
sian with a scale free spectrum, then the unconditional mass 
function can be expressed in units in which it has a univer- 
sal form that is independent of redshift and power spectrum. 
Namely, let v = [S c (z)/a(m)] 2 , where S c (z) is that critical 
value of the initial overdensity which is required for collapse 
at z, computed using the spherical collapse model, and cr(m) 
is the value of the rms fluctuation in spheres which on aver- 
age contain mass m at the initial time, extrapolated using 
linear theory to z. For example, in models with £7o = 1 and 
A = 0, 8 c (z) = 1.68647, and a(m) oc (1 + z) -1 . Then, 
for initially scale free spectra in models with flo = 1 and 
A = 0, 



2 n(m, z) dlogm 
p d log v 



(9) 



has a universal shape (Press-Schechter 1974; Peebles 1980; 
Bond et al. 1991). It is not obvious that this scaling holds 
for more general initial power spectra, such as those of the 
GIF simulations. 

The different symbols in Fig. ^ show the unconditional 
mass function at different output times in the GIF simula- 
tions plotted as a function of the scaled variable v. The dot- 
ted line shows the Press-Schechter formula for v f(v); note 
the discrepancy at both high and low v. The dot-dashed 
line, which fits the data slightly better than the dashed line, 
is the mass function computed using the Zeldovich approx- 
imation by Lee & Shandarin (1998) (see Appendix A). The 
solid line through the data points shows a modification to 
the Press-Schechter function. It has the form: 



uf(y) = A 



1/2 



y'/2 



(10) 



where v = au, with a = 0.707 and u = (<5 c /cr) 2 as described 
in the previous paragraph, p — 0.3, and A is determined by 
requiring that the integral of f(y) over all v give unity. The 
original Press-Schechter formula has a = 1, p = 0, and 
A = 1/2. 

Recall that, in principle, the shape of v f(v) can depend 
on the shape of the initial power spectrum, and on redshift. 
Since the different symbols in each panel overlap each other 
reasonably well, Fig. ^| shows that, in the GIF simulations at 
least, it is a good approximation to say that the mass func- 
tions scale similarly to the scale free case. So, although in 
principle we could have allowed a(z, Qq, Ao) and p(z, Qq, Ao) 
in equation ( p"o| ) (in fact, we could even have allowed a dif- 
ferent functional form for each output time of each model), 
Fig. |2] shows that, in practice, equation ( JTo) ) with the same 
value of p provides a reasonably good description of the mass 
function at all output times. This suggests that the dynam- 
ics of collapse is sensitive to v, and not to the mass scale 
itself. That the same value of p is a good approximation to 
v f(u) in all three panels is presumably a consequence of 
the fact that the shapes of the initial power spectra in these 
models are not very different. Therefore, in what follows, 
rather than finding best fit shapes to v f(u) at each output 



© 0000 RAS, MNRAS 000, 000-000 



4 Ravi K. Sheth & Giuseppe Tormen 



10 • 



1 ^ 




10 11 10 12 10 13 10 11 10 12 10 13 10 11 10 12 10 13 10 14 
M[M /h] M[M /h] M[M Q /h] 

Figure 3. The large scale Eulerian bias relation at z t, s = Zf orm between haloes which are identified at Zf orm , and the mass. Solid curves 
show the relation we predict using the GIF mass function, dotted curves show the relations which follow from the Press-Schechter mass 
function, and dot-dashed curves show the relations associated with the Zeldovich mass function. 



time of each model, we will simply use this one function with 
the given constants a and p. 

This fortuitous rescaling of the mass function is use- 
ful because it means that we need compute the peak- 
background split limit associated with the unconditional 
mass function only once, rather than having to compute it 
for each output time of each model. Simply set v — > v\q in 
the expression above, where 



[8 c ( Zl ) - S c (z )] 2 (S 1 -S ) 2 



V 5i 



2 (mi) 



2 (Mo) 



a(mi) scaled 
2 , and 



require 



with 5i = (5 c (Sli), where D.i = D.(zi), °i 5 
using linear theory to z\ and v\ = (Si/ai 
that /(mi|Mo)dmi = / (zao) d^io ■ 

Once the unconditional and conditional mass functions 
are known, the peak background split bias in Lagrangian 
space is easy to compute. It is 



*h(l|0) 



1 + 



2p 



1 + (aui) 



5o 
Si 



: &Sg(mi,(5i) S , (11) 



where the final expression defines 6 Lag • When a — 1 and 
p — the mass function has the Press-Schechter form, and 
this formula reduces to the one given by Cole & Kaiser 
(1989) and Mo & White (1996). However, the GIF mass 
function has a — 0.707 and p = 0.3. Since v\ increases with 
increasing halo mass, the final term in the expression above 
is negligible for massive haloes. In this limit, the expression 
above resembles the original Press-Schechter based formula, 



except for the factor of a. Since a < 1, our formula predicts 
that massive haloes are slightly less biased than the origi- 
nal Press-Schechter based formula predicts. For less massive 
haloes the extra term cannot be ignored. It is positive for 
all v\, so our formula predicts that less massive haloes are 
more positively biased (or less anti-biased) than the Mo & 
White formula suggests. The bias in Eulerian space is got 
by substituting this expression into equation (jjj): 

2p/<5i 



&ES"(mi,(5i) = 1 



av\ — 1 



+ 



1 + (av x y 



(12) 



Both the trends discussed above remain true in Eulerian 
space. 

The different symbols in Fig. ^| show the large scale bias 
relation for haloes that are identified at Zform and are ob- 
served at ;z bs = zform- That is, they show the small k values 
of b 2 (k) obtained as the value predicted at k = 2-7r/L by a 
linear least-square fit to the 12 left-most data points. Error 
bars show the formal least-square error. In this way we tried 
to estimate the actual large-scale bias even when b(k) on the 
largest scales probed by the simulations shows a dependence 
on k. One might argue that the weak scale dependence of 
b(k) in Fig. [j means that the symbols in Fig. ^| probably 
slightly under/overestimate the true large scale (small k) 
value for small/ large values of b(k). 

Each panel shows four sets of curves, corresponding to 
haloes which formed at Zf OI = 4, 2, 1, and 0. The three 
curves for each redshift show how the bias relation com- 
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Figure 2. The unconditional mass functions from five different 
output times (filled triangles, open triangles, open squares, filled 
circles, open circles show results for z = 0, z = 0.5, z = 1, z = 2 
and z = 4) in the GfF simulations plotted as a function of the 
scaled variable v. Dotted curve shows the Press-Schechter predic- 
tion, dot-dashed curve shows the mass function associated with 
the Zeldovich approximation, and solid curve shows our modified 
fitting function. 



puted using our peak background split model depends on 
the unconditional mass function. Solid and dotted curves 
show equation (|l|) for the GIF (a = 0.707 and p = 0.3) 
and the Press-Schechter (a = 1, p = 0) mass functions, re- 
spectively. The dot-dashed curves show the bias associated 
with the Zeldovich mass function (equation A3). As with 



the mass function, the dot-dashed curves fit the data better 
than the dotted curves, and slightly worse than the solid 
ones. The solid curves fit the data reasonably well for all 
masses and redshifts. 

There are small discrepancies between our GIF based 
predictions and the symbols at large and small b(M). Since 
they are in the sense discussed earlier, we suspect some of the 
discrepancy arises from the weak scale dependence of b(k). 



Figure 4. The large scale Eulerian bias relation of the previous 
figure, rescaled as indicated by the axis labels. Solid curves show 
the relation we predict using the GIF mass function, dot-dashed 
curves show the corresponding relation predicted using the Zel- 
dovich mass function, and dotted curves show the relation which 
follows from the Press-Schechter mass function. 



Since these discrepancies are small, and since our measured 
values of b(M) at small M when z — are very similar 
to those in Fig. 3 of Jing (1998), it appears that using the 
ratio of the power spectra to define 6e u i gives nearly the 
same result as using the ratio of the correlation functions. As 
discussed at the end of the previous section, this is expected 
if the bias is independent of scale. 

Suppose that the unconditional mass function v f(v) re- 
ally was independent of Zf or m (recall that our Fig. || shows 
this is a good approximation). Then our peak background 
split argument says that, after appropriate transformations, 
it should be possible to present the large scale bias relation 
as function of v only. Namely, equation ( |l^ ) suggests that 
we should be able to scale all the results for the different 
•Zform above to one plot of the product (Si &Lag) versus v. 
Fig. ^ shows the result. The symbols show the simulation 
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Figure 5. The square root of the ratio of the halo and matter 
power spectra for haloes that formed at Zf OTm = 4 (top panels) 
and 2f orm = 2 (bottom panels) but were observed at z b s = 0. 



data from the same output times as before. They were ob- 
tained by transforming the large scale Eulerian bias relation 
as indicated by the labels on the axes. The curves show the 
theoretical predictions for the three different mass functions 
we have been considering. 

There are two points to be made about this plot. The 
first is that our peak background split model (solid curves) 
provides a reasonable description of the results. The main 
reason for including this plot is to show that the scatter 
about the solid curves in our plots is approximately the same 
as the scatter about the mass function fits in Fig. ^. As wc 
argued above, we have no a priori reason to expect such a 
scaling if our relation between the unconditional mass func- 
tion and the Eulerian bias were not accurate. 

Finally, we can also study the case in which haloes form 
at high redshift, but are observed at lower redshift: Zf ovul > 
z bs- In this case, the large scale Eulerian bias should be 
described by our equations (Q) and (^), but with 

_ <5c(fiform) , - _ D(z obs ) x ,„ v 

V\ = — — r~ and 01 = TT7 r Oc("fortn) 

o- 2 (m) D(z form ) 

where D(z) is the linear theory growth factor, and a(m) is 
scaled using linear theory to its value at Zf orm . 

Fig. |E] shows the ratio of the halo and matter power 
spectra for haloes which formed at Zf orm > but were stud- 
ied at 2 bs = 0. Thus, it is similar to Fig. |l], except that there 
Zobs = Zform- There are a few obvious differences: some of 
the b(k) curves are not particularly flat, even on the largest 
scales. Those curves that do asymptote nicely (at small k) 
do so at a value of b(k) that is closer to unity than when 
Zobs = Zform- Moreover, the scale dependence of b(k) is dif- 
ferent from before: whereas previously b(k) for these haloes 
increased with increasing k, now it decreases. 

Estimating the large-scale value of b(k) by a least-square 
fit as done for Fig. ^| yields the symbols shown in Fig. ^| 
Most of the symbols are well fit by our simple formula for 
the large scale bias relation. The symbols that are not well 
fit by our formula (typically those in the highest mass bins) 
almost always correspond to haloes in a mass range in which 
the power spectra were not particularly flat. We conclude, 




10 11 10 12 10 13 10 14 
M[M Q /h] 

Figure 6. The large scale Eulerian bias relation at z Q b s = 
for haloes that formed at Zf or m- Solid curves show the relation 
we predict using the GIF mass function, dotted and dot-dashed 
curves show the relations which follow from the Press-Schechter 
and Zeldovich mass functions, respectively. 

therefore, that our formula for the bias relation is reasonably 
accurate in this two epoch context as well. 



4 DISCUSSION 

Collapsed haloes are biased tracers of the dark matter distri- 
bution. This bias depends on halo mass. In general, to quan- 
tify this bias requires a detailed knowledge of the merger his- 
tories of dark matter haloes. We used the peak background 
split to argue that, on large scales, this detailed information 
is unnecessary: knowledge of only the shape of the uncon- 
ditional mass function is sufficient to compute a good ap- 
proximation to the large scale bias relation. This fact was 
anticipated by Sheth & Lemson (1999), but seems to have 
been unnoticed before. 

For example, our approach is different from, but consis- 
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tent with, the recent work of Jing (1998) and Porciani, Cate- 
lan & Lacey (1998). Following Sheth & Lemson's demon- 
stration that the bias in Lagrangian space was not well de- 
scribed by standard models, Porciani et al. quantified the 
discrepancy. They showed that using equation (Q) to trans- 
form Jing's (1998) fitting formula for the Eulerian bias pro- 
vides a reasonably good fit to the Lagrangian bias factor. In 
other words, their results show that equation (Q) is relatively 
accurate. They then provided fitting functions for the La- 
grangian bias relation. However, since we have argued that 
the Lagrangian bias is related to the halo mass function, we 
have chosen to provide a fit to the mass function instead. 

In so doing, we showed that a simple modification 
(equation |l(j) of the Press-Schechter formula provides a 
good fit to the unconditional halo mass function in the 
SCDM, OCDM and ACDM models we studied (Fig. g). 
This allowed us to derive a single, simple, physically moti- 
vated formula for the large scale bias relation (equation |l2| ) 
that can be used for haloes of all masses at all times in 
all three cosmologies. Comparison of the predictions of our 
model with numerical simulations showed good agreement. 
We showed that this was true for the case in which haloes 
are observed at the same time as when they are first iden- 
tified (z bs = Zform), and also for the case in which haloes 
form at zf orm , evolve, and are observed at some later time 

^obs ^ -2-form ■ 

If haloes are significantly more/less massive than the 
typical M* halo at the time of formation, then our formulae 
predict that they should be less/more biased than results 
based on the Press-Schechter formula would predict. Since 
galaxies are thought to form in small, i.e., M/M*(2f orm ) < 1, 
haloes (Kauffmann et al. 1998b; Baugh et al. 1998), our re- 
sults are useful for galaxy formation models. The first gen- 
eration of stars and high redshift quasars are thought to be 
associated with haloes having M/M, (zf orlI1 ) 3> 1 (Haehnelt, 
Natarajan & Rees 1998; Haiman & Loeb 1998), so our re- 
sults (for the number density as well as the bias factor) may 
also be useful for studies of the reionization history of our 
Universe. 

Although we showed how the bias relation depends on 
the unconditional mass function (for example, the bias re- 
lation associated with the simple Press-Schechter spherical 
collapse mass function is different from that associated with 
the Zeldovich mass function), we did not provide a deriva- 
tion of the shape of the mass function itself. Since the mass 
function in the GIF simulations is different from those pre- 
dicted by simple applications of the spherical collapse model 
or by the Zeldovich approximation, our model is still far 
from complete. We hope that our results will stimulate work 
aimed at rectifying this. 

In this regard, we think it important to stress the fol- 
lowing fact. Bond et al. (1991) used a random walk barrier 
crossing model to derive the Press-Schechter unconditional 
mass function from the statistics of the initial fluctuation 
field. In their model, the Press-Schechter mass function is 
associated with the first crossing distribution of a barrier of 
constant height. The barrier height is constant because, in 
the spherical collapse model, there is a critical overdensity 
(5 C required for collapse, and this value is independent of the 
mass of the collapsed object. If this same barrier crossing 
model is to yield the GIF mass function, then the barrier 
shape must depend on mass. We have solved for this barrier 



shape, and are in the process of developing and testing the 
detailed predictions of such a moving barrier model further. 
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APPENDIX A: LARGE SCALE BIAS AND THE 
ZELDOVICH APPROXIMATION 

The universal unconditional mass function associated with 
the Zeldovich approximation is given by equation (g) with 

vf[y) = Z^iy') + Z h (y') + Z c (u'), 

where 



Z^u') 
Zb(v) 



25 10v' I 5v' 1 
4 V 7T V 3 _ 12 



Z c (u') = 




125^5 , f-9v' 



12 , V 6XP V 2 



and 



v = (A C /5 C ) 2 v = {Xc/Sc) 2 {Sc/uY 



(Al) 



© 0000 RAS, MNRAS 000, 000-000 



8 Ravi K. Sheth & Giuseppe Tormen 



(Lee & Shandarin 1998). In these variables, the mass func- 
tion has two free parameters: the usual spherical collapse 
5c, and the ratio (A C /<5 C ). The dot-dashed curve in Fig. ^ 
shows this function with the value used by Lee & Shandarin: 
A c = 0.38. 

One way to approximate the associated conditional 
mass function is to use the peak background split discussed 
in the main text. In this approximation, one simply sets 



(A c i — A c o) 2 



v'i (1 — S c o/S c i) 2 



into the expression for f(u) du above. Although we have not 
done so here, in principle, one could also allow A c (z, fio, Ao). 
In our case, the associated Lagrangian large scale bias is 
given by equation (W), so 



#(1|0) 



So 
Si 



5u[-l-2Z a (u' 1 ) 



20uj 
20v{ - 



+j {2 V [ Z h {v[) - Z c (v[) 



bilgim,!, Si) S . 



(A2) 



The Eulerian space peak background split bias factor for the 
Zeldovich approximation is given by inserting this expression 
for &La g in equation (fjj): 



&iui("ii,<5i) = 1 + &Lag(rm,<5i). 



(A3) 
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